Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 1 065 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 141.6 KiB |
| Average record size in memory | 136.1 B |
Variable types
| NUM | 15 |
|---|---|
| CAT | 2 |
Paris_max is highly correlated with Paris_min and 1 other fields | High correlation |
Paris_min is highly correlated with Paris_max and 1 other fields | High correlation |
Paris_avg is highly correlated with Paris_min and 1 other fields | High correlation |
reactions is highly correlated with retweetCount and 1 other fields | High correlation |
retweetCount is highly correlated with reactions | High correlation |
likeCount is highly correlated with reactions | High correlation |
reactionsAvg is highly correlated with retweetAvg and 1 other fields | High correlation |
retweetAvg is highly correlated with reactionsAvg | High correlation |
likeAvg is highly correlated with reactionsAvg | High correlation |
quoteAvg is highly skewed (γ1 = 21.88984396) | Skewed |
date has unique values | Unique |
quoteCount has 86 (8.1%) zeros | Zeros |
quoteAvg has 92 (8.6%) zeros | Zeros |
contest has 765 (71.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-01-14 16:43:51.915944 |
|---|---|
| Analysis finished | 2021-01-14 16:44:25.716474 |
| Duration | 33.8 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 1065 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.3 KiB |
| 2019-12-25 | 1 |
|---|---|
| 2018-03-17 | 1 |
| 2018-12-08 | 1 |
| 2019-05-15 | 1 |
| 2020-10-15 | 1 |
| Other values (1060) |
| Value | Count | Frequency (%) | |
| 2019-12-25 | 1 | 0.1% | |
| 2018-03-17 | 1 | 0.1% | |
| 2018-12-08 | 1 | 0.1% | |
| 2019-05-15 | 1 | 0.1% | |
| 2020-10-15 | 1 | 0.1% | |
| 2018-09-20 | 1 | 0.1% | |
| 2020-07-25 | 1 | 0.1% | |
| 2019-11-23 | 1 | 0.1% | |
| 2019-12-07 | 1 | 0.1% | |
| 2020-03-11 | 1 | 0.1% | |
| Other values (1055) | 1055 | 99.1% |
Unique
| Unique | 1065 ? |
|---|---|
| Unique (%) | 100.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Paris_weather
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.3 KiB |
| 3 | |
|---|---|
| 2 | |
| 1 | 59 |
| 0 | 4 |
| Value | Count | Frequency (%) | |
| 3 | 594 | 55.8% | |
| 2 | 408 | 38.3% | |
| 1 | 59 | 5.5% | |
| 0 | 4 | 0.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
| Distinct | 38 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.55868545 |
|---|---|
| Minimum | -5 |
| Maximum | 32 |
| Zeros | 10 |
| Zeros (%) | 0.9% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | -5 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 7 |
| median | 12 |
| Q3 | 18 |
| 95-th percentile | 23 |
| Maximum | 32 |
| Range | 37 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 6.699131786 |
|---|---|
| Coefficient of variation (CV) | 0.5334261946 |
| Kurtosis | -0.5646446869 |
| Mean | 12.55868545 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.1190360116 |
| Sum | 13375 |
| Variance | 44.87836669 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 7 | 73 | 6.9% | |
| 12 | 61 | 5.7% | |
| 9 | 59 | 5.5% | |
| 8 | 57 | 5.4% | |
| 10 | 54 | 5.1% | |
| 19 | 53 | 5.0% | |
| 16 | 52 | 4.9% | |
| 15 | 50 | 4.7% | |
| 13 | 49 | 4.6% | |
| 18 | 48 | 4.5% | |
| Other values (28) | 509 | 47.8% |
| Value | Count | Frequency (%) | |
| -5 | 1 | 0.1% | |
| -4 | 2 | 0.2% | |
| -3 | 4 | 0.4% | |
| -2 | 1 | 0.1% | |
| -1 | 9 | 0.8% |
| Value | Count | Frequency (%) | |
| 32 | 1 | 0.1% | |
| 31 | 1 | 0.1% | |
| 30 | 2 | 0.2% | |
| 29 | 2 | 0.2% | |
| 28 | 3 | 0.3% |
| Distinct | 42 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.30985915 |
|---|---|
| Minimum | -2 |
| Maximum | 40 |
| Zeros | 2 |
| Zeros (%) | 0.2% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 11 |
| median | 17 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 40 |
| Range | 42 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 7.810234422 |
|---|---|
| Coefficient of variation (CV) | 0.4512015004 |
| Kurtosis | -0.6476254895 |
| Mean | 17.30985915 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.1752608674 |
| Sum | 18435 |
| Variance | 60.99976173 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 11 | 65 | 6.1% | |
| 10 | 59 | 5.5% | |
| 25 | 58 | 5.4% | |
| 23 | 48 | 4.5% | |
| 22 | 47 | 4.4% | |
| 15 | 47 | 4.4% | |
| 8 | 46 | 4.3% | |
| 12 | 46 | 4.3% | |
| 20 | 46 | 4.3% | |
| 24 | 43 | 4.0% | |
| Other values (32) | 560 | 52.6% |
| Value | Count | Frequency (%) | |
| -2 | 1 | 0.1% | |
| -1 | 1 | 0.1% | |
| 0 | 2 | 0.2% | |
| 1 | 6 | 0.6% | |
| 2 | 6 | 0.6% |
| Value | Count | Frequency (%) | |
| 40 | 1 | 0.1% | |
| 38 | 3 | 0.3% | |
| 37 | 4 | 0.4% | |
| 36 | 4 | 0.4% | |
| 35 | 4 | 0.4% |
| Distinct | 71 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.9342723 |
|---|---|
| Minimum | -3 |
| Maximum | 36 |
| Zeros | 3 |
| Zeros (%) | 0.3% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | -3 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 9.5 |
| median | 14.5 |
| Q3 | 20.5 |
| 95-th percentile | 26.5 |
| Maximum | 36 |
| Range | 39 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 7.169792199 |
|---|---|
| Coefficient of variation (CV) | 0.4800898266 |
| Kurtosis | -0.6280821958 |
| Mean | 14.9342723 |
| Median Absolute Deviation (MAD) | 5.5 |
| Skewness | 0.1407584137 |
| Sum | 15905 |
| Variance | 51.40592017 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 9.5 | 41 | 3.8% | |
| 8.5 | 34 | 3.2% | |
| 21 | 30 | 2.8% | |
| 10.5 | 29 | 2.7% | |
| 12 | 29 | 2.7% | |
| 20 | 29 | 2.7% | |
| 15 | 28 | 2.6% | |
| 15.5 | 27 | 2.5% | |
| 11 | 27 | 2.5% | |
| 17.5 | 26 | 2.4% | |
| Other values (61) | 765 | 71.8% |
| Value | Count | Frequency (%) | |
| -3 | 2 | 0.2% | |
| -1.5 | 3 | 0.3% | |
| -1 | 2 | 0.2% | |
| 0 | 3 | 0.3% | |
| 0.5 | 5 | 0.5% |
| Value | Count | Frequency (%) | |
| 36 | 1 | 0.1% | |
| 34.5 | 1 | 0.1% | |
| 33 | 1 | 0.1% | |
| 32.5 | 5 | 0.5% | |
| 32 | 3 | 0.3% |
tweetCount
Real number (ℝ≥0)
| Distinct | 60 |
|---|---|
| Distinct (%) | 5.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.9342723 |
|---|---|
| Minimum | 1 |
| Maximum | 137 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 6 |
| median | 10 |
| Q3 | 21 |
| 95-th percentile | 38 |
| Maximum | 137 |
| Range | 136 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 12.46368405 |
|---|---|
| Coefficient of variation (CV) | 0.8345692245 |
| Kurtosis | 9.769656778 |
| Mean | 14.9342723 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 2.092820484 |
| Sum | 15905 |
| Variance | 155.3434202 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 7 | 86 | 8.1% | |
| 5 | 73 | 6.9% | |
| 6 | 72 | 6.8% | |
| 9 | 70 | 6.6% | |
| 8 | 70 | 6.6% | |
| 10 | 58 | 5.4% | |
| 4 | 53 | 5.0% | |
| 3 | 35 | 3.3% | |
| 11 | 34 | 3.2% | |
| 12 | 34 | 3.2% | |
| Other values (50) | 480 | 45.1% |
| Value | Count | Frequency (%) | |
| 1 | 18 | 1.7% | |
| 2 | 23 | 2.2% | |
| 3 | 35 | 3.3% | |
| 4 | 53 | 5.0% | |
| 5 | 73 | 6.9% |
| Value | Count | Frequency (%) | |
| 137 | 1 | 0.1% | |
| 74 | 1 | 0.1% | |
| 71 | 1 | 0.1% | |
| 66 | 1 | 0.1% | |
| 63 | 1 | 0.1% |
replyCount
Real number (ℝ≥0)
| Distinct | 351 |
|---|---|
| Distinct (%) | 33.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 240.7352113 |
|---|---|
| Minimum | 0 |
| Maximum | 19166 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 21 |
| median | 49 |
| Q3 | 140 |
| 95-th percentile | 806.6 |
| Maximum | 19166 |
| Range | 19166 |
| Interquartile range (IQR) | 119 |
Descriptive statistics
| Standard deviation | 968.2938212 |
|---|---|
| Coefficient of variation (CV) | 4.02223595 |
| Kurtosis | 180.1601348 |
| Mean | 240.7352113 |
| Median Absolute Deviation (MAD) | 35 |
| Skewness | 11.67079742 |
| Sum | 256383 |
| Variance | 937592.9242 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 14 | 25 | 2.3% | |
| 17 | 19 | 1.8% | |
| 18 | 18 | 1.7% | |
| 10 | 18 | 1.7% | |
| 16 | 17 | 1.6% | |
| 29 | 17 | 1.6% | |
| 11 | 16 | 1.5% | |
| 8 | 16 | 1.5% | |
| 22 | 16 | 1.5% | |
| 6 | 16 | 1.5% | |
| Other values (341) | 887 | 83.3% |
| Value | Count | Frequency (%) | |
| 0 | 1 | 0.1% | |
| 1 | 5 | 0.5% | |
| 2 | 2 | 0.2% | |
| 3 | 6 | 0.6% | |
| 4 | 8 | 0.8% |
| Value | Count | Frequency (%) | |
| 19166 | 1 | 0.1% | |
| 12880 | 1 | 0.1% | |
| 9485 | 1 | 0.1% | |
| 7889 | 1 | 0.1% | |
| 6741 | 1 | 0.1% |
| Distinct | 458 |
|---|---|
| Distinct (%) | 43.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 915.1568075 |
|---|---|
| Minimum | 0 |
| Maximum | 71654 |
| Zeros | 3 |
| Zeros (%) | 0.3% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 33 |
| median | 93 |
| Q3 | 252 |
| 95-th percentile | 1750.6 |
| Maximum | 71654 |
| Range | 71654 |
| Interquartile range (IQR) | 219 |
Descriptive statistics
| Standard deviation | 5008.105393 |
|---|---|
| Coefficient of variation (CV) | 5.472401398 |
| Kurtosis | 98.75307096 |
| Mean | 915.1568075 |
| Median Absolute Deviation (MAD) | 73 |
| Skewness | 9.354410085 |
| Sum | 974642 |
| Variance | 25081119.63 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 16 | 14 | 1.3% | |
| 22 | 13 | 1.2% | |
| 20 | 13 | 1.2% | |
| 28 | 12 | 1.1% | |
| 18 | 12 | 1.1% | |
| 27 | 12 | 1.1% | |
| 14 | 11 | 1.0% | |
| 11 | 11 | 1.0% | |
| 7 | 11 | 1.0% | |
| 72 | 10 | 0.9% | |
| Other values (448) | 946 | 88.8% |
| Value | Count | Frequency (%) | |
| 0 | 3 | 0.3% | |
| 1 | 4 | 0.4% | |
| 2 | 3 | 0.3% | |
| 3 | 4 | 0.4% | |
| 4 | 6 | 0.6% |
| Value | Count | Frequency (%) | |
| 71654 | 1 | 0.1% | |
| 60415 | 1 | 0.1% | |
| 55466 | 1 | 0.1% | |
| 52301 | 1 | 0.1% | |
| 48772 | 1 | 0.1% |
| Distinct | 794 |
|---|---|
| Distinct (%) | 74.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1259.823474 |
|---|---|
| Minimum | 15 |
| Maximum | 28362 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 120 |
| Q1 | 301 |
| median | 596 |
| Q3 | 1193 |
| 95-th percentile | 4388.6 |
| Maximum | 28362 |
| Range | 28347 |
| Interquartile range (IQR) | 892 |
Descriptive statistics
| Standard deviation | 2414.301019 |
|---|---|
| Coefficient of variation (CV) | 1.916380405 |
| Kurtosis | 54.08335345 |
| Mean | 1259.823474 |
| Median Absolute Deviation (MAD) | 362 |
| Skewness | 6.409173438 |
| Sum | 1341712 |
| Variance | 5828849.412 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 162 | 5 | 0.5% | |
| 208 | 4 | 0.4% | |
| 175 | 4 | 0.4% | |
| 433 | 4 | 0.4% | |
| 277 | 4 | 0.4% | |
| 146 | 4 | 0.4% | |
| 353 | 4 | 0.4% | |
| 742 | 4 | 0.4% | |
| 236 | 3 | 0.3% | |
| 454 | 3 | 0.3% | |
| Other values (784) | 1026 | 96.3% |
| Value | Count | Frequency (%) | |
| 15 | 1 | 0.1% | |
| 16 | 1 | 0.1% | |
| 24 | 1 | 0.1% | |
| 26 | 1 | 0.1% | |
| 29 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 28362 | 1 | 0.1% | |
| 28360 | 1 | 0.1% | |
| 26416 | 1 | 0.1% | |
| 19836 | 1 | 0.1% | |
| 19151 | 1 | 0.1% |
| Distinct | 117 |
|---|---|
| Distinct (%) | 11.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.93802817 |
|---|---|
| Minimum | 0 |
| Maximum | 5751 |
| Zeros | 86 |
| Zeros (%) | 8.1% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 6 |
| Q3 | 15 |
| 95-th percentile | 74.2 |
| Maximum | 5751 |
| Range | 5751 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 261.2853745 |
|---|---|
| Coefficient of variation (CV) | 6.710287776 |
| Kurtosis | 299.1708256 |
| Mean | 38.93802817 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 15.984958 |
| Sum | 41469 |
| Variance | 68270.04691 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 110 | 10.3% | |
| 1 | 92 | 8.6% | |
| 0 | 86 | 8.1% | |
| 3 | 84 | 7.9% | |
| 4 | 74 | 6.9% | |
| 5 | 57 | 5.4% | |
| 6 | 57 | 5.4% | |
| 9 | 39 | 3.7% | |
| 10 | 35 | 3.3% | |
| 7 | 33 | 3.1% | |
| Other values (107) | 398 | 37.4% |
| Value | Count | Frequency (%) | |
| 0 | 86 | 8.1% | |
| 1 | 92 | 8.6% | |
| 2 | 110 | 10.3% | |
| 3 | 84 | 7.9% | |
| 4 | 74 | 6.9% |
| Value | Count | Frequency (%) | |
| 5751 | 1 | 0.1% | |
| 4385 | 1 | 0.1% | |
| 2388 | 1 | 0.1% | |
| 2076 | 1 | 0.1% | |
| 2063 | 1 | 0.1% |
| Distinct | 869 |
|---|---|
| Distinct (%) | 81.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2454.653521 |
|---|---|
| Minimum | 16 |
| Maximum | 106632 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 146.6 |
| Q1 | 373 |
| median | 799 |
| Q3 | 1679 |
| 95-th percentile | 7281.8 |
| Maximum | 106632 |
| Range | 106616 |
| Interquartile range (IQR) | 1306 |
Descriptive statistics
| Standard deviation | 8004.028325 |
|---|---|
| Coefficient of variation (CV) | 3.260756867 |
| Kurtosis | 86.4412054 |
| Mean | 2454.653521 |
| Median Absolute Deviation (MAD) | 506 |
| Skewness | 8.598996704 |
| Sum | 2614206 |
| Variance | 64064469.43 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 466 | 5 | 0.5% | |
| 211 | 5 | 0.5% | |
| 428 | 4 | 0.4% | |
| 196 | 4 | 0.4% | |
| 882 | 4 | 0.4% | |
| 431 | 3 | 0.3% | |
| 536 | 3 | 0.3% | |
| 448 | 3 | 0.3% | |
| 251 | 3 | 0.3% | |
| 322 | 3 | 0.3% | |
| Other values (859) | 1028 | 96.5% |
| Value | Count | Frequency (%) | |
| 16 | 1 | 0.1% | |
| 17 | 1 | 0.1% | |
| 25 | 1 | 0.1% | |
| 30 | 1 | 0.1% | |
| 34 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 106632 | 1 | 0.1% | |
| 96783 | 1 | 0.1% | |
| 95701 | 1 | 0.1% | |
| 80334 | 1 | 0.1% | |
| 70539 | 1 | 0.1% |
replyAvg
Real number (ℝ≥0)
| Distinct | 297 |
|---|---|
| Distinct (%) | 27.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.71492958 |
|---|---|
| Minimum | 0 |
| Maximum | 538 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.1 |
| Q1 | 2.2 |
| median | 4.2 |
| Q3 | 9.6 |
| 95-th percentile | 57.88 |
| Maximum | 538 |
| Range | 538 |
| Interquartile range (IQR) | 7.4 |
Descriptive statistics
| Standard deviation | 47.98885123 |
|---|---|
| Coefficient of variation (CV) | 3.053710868 |
| Kurtosis | 63.86485253 |
| Mean | 15.71492958 |
| Median Absolute Deviation (MAD) | 2.5 |
| Skewness | 7.384708025 |
| Sum | 16736.4 |
| Variance | 2302.929843 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 32 | 3.0% | |
| 1.8 | 26 | 2.4% | |
| 1 | 24 | 2.3% | |
| 3 | 24 | 2.3% | |
| 1.6 | 24 | 2.3% | |
| 2.8 | 21 | 2.0% | |
| 1.4 | 20 | 1.9% | |
| 1.9 | 20 | 1.9% | |
| 2.2 | 19 | 1.8% | |
| 1.7 | 19 | 1.8% | |
| Other values (287) | 836 | 78.5% |
| Value | Count | Frequency (%) | |
| 0 | 1 | 0.1% | |
| 0.4 | 1 | 0.1% | |
| 0.5 | 3 | 0.3% | |
| 0.6 | 3 | 0.3% | |
| 0.7 | 4 | 0.4% |
| Value | Count | Frequency (%) | |
| 538 | 1 | 0.1% | |
| 523.2 | 1 | 0.1% | |
| 518 | 1 | 0.1% | |
| 487 | 1 | 0.1% | |
| 442.1 | 1 | 0.1% |
| Distinct | 399 |
|---|---|
| Distinct (%) | 37.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57.9741784 |
|---|---|
| Minimum | 0 |
| Maximum | 4856.4 |
| Zeros | 4 |
| Zeros (%) | 0.4% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.5 |
| Q1 | 3.6 |
| median | 7.5 |
| Q3 | 20.4 |
| 95-th percentile | 122.98 |
| Maximum | 4856.4 |
| Range | 4856.4 |
| Interquartile range (IQR) | 16.8 |
Descriptive statistics
| Standard deviation | 334.2173081 |
|---|---|
| Coefficient of variation (CV) | 5.764933929 |
| Kurtosis | 137.8956533 |
| Mean | 57.9741784 |
| Median Absolute Deviation (MAD) | 5.1 |
| Skewness | 11.18405162 |
| Sum | 61742.5 |
| Variance | 111701.209 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1.8 | 17 | 1.6% | |
| 3.3 | 17 | 1.6% | |
| 4 | 17 | 1.6% | |
| 2.2 | 15 | 1.4% | |
| 2 | 15 | 1.4% | |
| 3.6 | 14 | 1.3% | |
| 3.7 | 14 | 1.3% | |
| 3 | 14 | 1.3% | |
| 2.8 | 13 | 1.2% | |
| 3.2 | 13 | 1.2% | |
| Other values (389) | 916 | 86.0% |
| Value | Count | Frequency (%) | |
| 0 | 4 | 0.4% | |
| 0.1 | 2 | 0.2% | |
| 0.4 | 1 | 0.1% | |
| 0.6 | 4 | 0.4% | |
| 0.7 | 3 | 0.3% |
| Value | Count | Frequency (%) | |
| 4856.4 | 1 | 0.1% | |
| 4641.4 | 1 | 0.1% | |
| 4433.8 | 1 | 0.1% | |
| 4358.4 | 1 | 0.1% | |
| 2985.6 | 1 | 0.1% |
| Distinct | 754 |
|---|---|
| Distinct (%) | 70.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98.50892019 |
|---|---|
| Minimum | 2.7 |
| Maximum | 2204 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 2.7 |
|---|---|
| 5-th percentile | 15.4 |
| Q1 | 29.2 |
| median | 52 |
| Q3 | 101.7 |
| 95-th percentile | 305.24 |
| Maximum | 2204 |
| Range | 2201.3 |
| Interquartile range (IQR) | 72.5 |
Descriptive statistics
| Standard deviation | 170.8700003 |
|---|---|
| Coefficient of variation (CV) | 1.734563732 |
| Kurtosis | 61.11687198 |
| Mean | 98.50892019 |
| Median Absolute Deviation (MAD) | 27.9 |
| Skewness | 6.754999647 |
| Sum | 104912 |
| Variance | 29196.557 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 24.2 | 6 | 0.6% | |
| 28.7 | 5 | 0.5% | |
| 24.1 | 5 | 0.5% | |
| 39 | 5 | 0.5% | |
| 49 | 5 | 0.5% | |
| 34.7 | 5 | 0.5% | |
| 34.2 | 5 | 0.5% | |
| 16.6 | 4 | 0.4% | |
| 52.2 | 4 | 0.4% | |
| 28.8 | 4 | 0.4% | |
| Other values (744) | 1017 | 95.5% |
| Value | Count | Frequency (%) | |
| 2.7 | 1 | 0.1% | |
| 6.6 | 1 | 0.1% | |
| 7.4 | 1 | 0.1% | |
| 7.5 | 1 | 0.1% | |
| 8.3 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 2204 | 1 | 0.1% | |
| 2108.6 | 1 | 0.1% | |
| 1609.1 | 1 | 0.1% | |
| 1595.9 | 1 | 0.1% | |
| 1525.4 | 1 | 0.1% |
| Distinct | 103 |
|---|---|
| Distinct (%) | 9.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.609014085 |
|---|---|
| Minimum | 0 |
| Maximum | 479.2 |
| Zeros | 92 |
| Zeros (%) | 8.6% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.2 |
| median | 0.5 |
| Q3 | 1.1 |
| 95-th percentile | 6.46 |
| Maximum | 479.2 |
| Range | 479.2 |
| Interquartile range (IQR) | 0.9 |
Descriptive statistics
| Standard deviation | 17.05283953 |
|---|---|
| Coefficient of variation (CV) | 6.536123982 |
| Kurtosis | 582.1100901 |
| Mean | 2.609014085 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 21.88984396 |
| Sum | 2778.6 |
| Variance | 290.799336 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.2 | 128 | 12.0% | |
| 0.3 | 107 | 10.0% | |
| 0.4 | 95 | 8.9% | |
| 0 | 92 | 8.6% | |
| 0.5 | 82 | 7.7% | |
| 0.1 | 77 | 7.2% | |
| 0.6 | 64 | 6.0% | |
| 0.8 | 44 | 4.1% | |
| 0.7 | 39 | 3.7% | |
| 1 | 38 | 3.6% | |
| Other values (93) | 299 | 28.1% |
| Value | Count | Frequency (%) | |
| 0 | 92 | 8.6% | |
| 0.1 | 77 | 7.2% | |
| 0.2 | 128 | 12.0% | |
| 0.3 | 107 | 10.0% | |
| 0.4 | 95 | 8.9% |
| Value | Count | Frequency (%) | |
| 479.2 | 1 | 0.1% | |
| 142.3 | 1 | 0.1% | |
| 118.5 | 1 | 0.1% | |
| 90.1 | 1 | 0.1% | |
| 86.5 | 1 | 0.1% |
| Distinct | 810 |
|---|---|
| Distinct (%) | 76.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 174.809108 |
|---|---|
| Minimum | 3.3 |
| Maximum | 7511 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 3.3 |
|---|---|
| 5-th percentile | 19.64 |
| Q1 | 36.7 |
| median | 67.8 |
| Q3 | 135.6 |
| 95-th percentile | 500.72 |
| Maximum | 7511 |
| Range | 7507.7 |
| Interquartile range (IQR) | 98.9 |
Descriptive statistics
| Standard deviation | 526.7717668 |
|---|---|
| Coefficient of variation (CV) | 3.013411446 |
| Kurtosis | 119.2081815 |
| Mean | 174.809108 |
| Median Absolute Deviation (MAD) | 38.5 |
| Skewness | 10.05089503 |
| Sum | 186171.7 |
| Variance | 277488.4943 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 40.8 | 4 | 0.4% | |
| 51.6 | 4 | 0.4% | |
| 66.6 | 4 | 0.4% | |
| 56.5 | 4 | 0.4% | |
| 21.7 | 4 | 0.4% | |
| 25 | 4 | 0.4% | |
| 27.1 | 4 | 0.4% | |
| 46.4 | 4 | 0.4% | |
| 89 | 4 | 0.4% | |
| 24.7 | 4 | 0.4% | |
| Other values (800) | 1025 | 96.2% |
| Value | Count | Frequency (%) | |
| 3.3 | 1 | 0.1% | |
| 9.2 | 1 | 0.1% | |
| 9.7 | 1 | 0.1% | |
| 9.9 | 1 | 0.1% | |
| 10.1 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 7511 | 1 | 0.1% | |
| 7495.3 | 1 | 0.1% | |
| 6694.5 | 1 | 0.1% | |
| 6412.6 | 1 | 0.1% | |
| 4443 | 1 | 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.523943662 |
|---|---|
| Minimum | 0 |
| Maximum | 12 |
| Zeros | 765 |
| Zeros (%) | 71.8% |
| Memory size | 8.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.138706989 |
|---|---|
| Coefficient of variation (CV) | 2.173338607 |
| Kurtosis | 20.25993441 |
| Mean | 0.523943662 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.673506457 |
| Sum | 558 |
| Variance | 1.296653606 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 765 | 71.8% | |
| 1 | 174 | 16.3% | |
| 2 | 66 | 6.2% | |
| 3 | 28 | 2.6% | |
| 5 | 13 | 1.2% | |
| 4 | 12 | 1.1% | |
| 6 | 3 | 0.3% | |
| 12 | 1 | 0.1% | |
| 10 | 1 | 0.1% | |
| 8 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 765 | 71.8% | |
| 1 | 174 | 16.3% | |
| 2 | 66 | 6.2% | |
| 3 | 28 | 2.6% | |
| 4 | 12 | 1.1% |
| Value | Count | Frequency (%) | |
| 12 | 1 | 0.1% | |
| 10 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| 7 | 1 | 0.1% | |
| 6 | 3 | 0.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| date | Paris_weather | Paris_min | Paris_max | Paris_avg | tweetCount | replyCount | retweetCount | likeCount | quoteCount | reactions | replyAvg | retweetAvg | likeAvg | quoteAvg | reactionsAvg | contest | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2018-01-01 | 2 | 7 | 9 | 8.0 | 8 | 22 | 173 | 806 | 3 | 1004 | 2.8 | 21.6 | 100.8 | 0.4 | 125.5 | 0 |
| 1 | 2018-01-02 | 2 | 5 | 11 | 8.0 | 32 | 34 | 117 | 567 | 3 | 721 | 1.1 | 3.7 | 17.7 | 0.1 | 22.5 | 0 |
| 2 | 2018-01-03 | 2 | 9 | 15 | 12.0 | 22 | 411 | 109 | 533 | 26 | 1079 | 18.7 | 5.0 | 24.2 | 1.2 | 49.0 | 0 |
| 3 | 2018-01-04 | 2 | 10 | 14 | 12.0 | 31 | 66 | 344 | 975 | 12 | 1397 | 2.1 | 11.1 | 31.5 | 0.4 | 45.1 | 0 |
| 4 | 2018-01-05 | 2 | 8 | 11 | 9.5 | 23 | 45 | 123 | 568 | 4 | 740 | 2.0 | 5.3 | 24.7 | 0.2 | 32.2 | 0 |
| 5 | 2018-01-06 | 2 | 6 | 7 | 6.5 | 11 | 17 | 87 | 322 | 5 | 431 | 1.5 | 7.9 | 29.3 | 0.5 | 39.2 | 0 |
| 6 | 2018-01-07 | 2 | 6 | 7 | 6.5 | 7 | 23 | 191 | 767 | 5 | 986 | 3.3 | 27.3 | 109.6 | 0.7 | 140.9 | 0 |
| 7 | 2018-01-08 | 3 | 5 | 8 | 6.5 | 29 | 28 | 275 | 1184 | 14 | 1501 | 1.0 | 9.5 | 40.8 | 0.5 | 51.8 | 0 |
| 8 | 2018-01-09 | 2 | 5 | 8 | 6.5 | 21 | 33 | 144 | 570 | 6 | 753 | 1.6 | 6.9 | 27.1 | 0.3 | 35.9 | 0 |
| 9 | 2018-01-10 | 2 | 7 | 8 | 7.5 | 31 | 44 | 126 | 471 | 5 | 646 | 1.4 | 4.1 | 15.2 | 0.2 | 20.8 | 0 |
Last rows
| date | Paris_weather | Paris_min | Paris_max | Paris_avg | tweetCount | replyCount | retweetCount | likeCount | quoteCount | reactions | replyAvg | retweetAvg | likeAvg | quoteAvg | reactionsAvg | contest | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1055 | 2020-11-21 | 3 | 6 | 10 | 8.0 | 7 | 447 | 1200 | 2094 | 242 | 3983 | 63.9 | 171.4 | 299.1 | 34.6 | 569.0 | 1 |
| 1056 | 2020-11-22 | 3 | 7 | 11 | 9.0 | 8 | 85 | 497 | 1077 | 70 | 1729 | 10.6 | 62.1 | 134.6 | 8.8 | 216.1 | 0 |
| 1057 | 2020-11-23 | 2 | 8 | 11 | 9.5 | 8 | 273 | 1095 | 2253 | 216 | 3837 | 34.1 | 136.9 | 281.6 | 27.0 | 479.6 | 2 |
| 1058 | 2020-11-24 | 2 | 7 | 10 | 8.5 | 15 | 87 | 383 | 1280 | 47 | 1797 | 5.8 | 25.5 | 85.3 | 3.1 | 119.8 | 1 |
| 1059 | 2020-11-25 | 2 | 7 | 11 | 9.0 | 10 | 149 | 495 | 1711 | 61 | 2416 | 14.9 | 49.5 | 171.1 | 6.1 | 241.6 | 2 |
| 1060 | 2020-11-26 | 3 | 8 | 12 | 10.0 | 10 | 58 | 438 | 1083 | 40 | 1619 | 5.8 | 43.8 | 108.3 | 4.0 | 161.9 | 0 |
| 1061 | 2020-11-27 | 3 | 8 | 11 | 9.5 | 14 | 322 | 1042 | 2142 | 181 | 3687 | 23.0 | 74.4 | 153.0 | 12.9 | 263.4 | 2 |
| 1062 | 2020-11-28 | 3 | 7 | 12 | 9.5 | 9 | 65 | 408 | 1143 | 63 | 1679 | 7.2 | 45.3 | 127.0 | 7.0 | 186.6 | 0 |
| 1063 | 2020-11-29 | 3 | 6 | 10 | 8.0 | 8 | 1321 | 2320 | 3656 | 494 | 7791 | 165.1 | 290.0 | 457.0 | 61.8 | 973.9 | 0 |
| 1064 | 2020-11-30 | 3 | 2 | 8 | 5.0 | 9 | 100 | 659 | 1397 | 77 | 2233 | 11.1 | 73.2 | 155.2 | 8.6 | 248.1 | 0 |